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Suppose each site on a one-dimensional chain with periodic boundary condition may take on any 
one of the states 0, 1, . . . ,n — 1, can you find out the most frequently occurring state using cellular 
automaton? Here, we prove that while the above density classification task cannot be resolved by a 
single cellular automaton, this task can be performed efficiently by applying two cellular automaton 
rules in succession. 
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Cellular automaton (CA) is a simple local parallel in- 
teraction model for many natural systems And 
from the computer science point of view, CA can be re- 
garded as a special kind of Turing machine without in- 
ternal memory. In fact, tailor-made CA can be used to 
simulate certain logical operations H . 

Since CA is essentially an internal-memoryless Turing 
machine, it is natural to ask if it can be used to perform 
certain tasks that require global counters. One such task, 
called (binary) density classification, was recently studied 
by Land and Belew Q . They consider a one-dimensional 
(finite but arbitrarily long) chain of sites with periodic 
boundary condition. Each site is in cither state zero or 
state one. That is, each site contains a Boolean state. 
Our task is to change the state of every site to one if 
the number of ones is more than the number of zeros in 
the chain. (That is, the density of one, p c , in the chain 
is greater than 1/2.) Otherwise, every site is set to the 
state zero. Clearly, the density classification problem is 
trivial if one uses a global counter. Alternatively, one can 
also solve this problem if the CA rule table scales with 
the number of sites N so that the CA model becomes 
nonlocal in the limit of large N. Nonetheless, Land and 
Belew proved that (binary) density classification cannot 
be done perfectly using a single one-dimensional CA jjj . 
Their proof can be extended to multiple dimensions, too. 

Quite unexpectedly, Fuks found that the (binary) den- 
sity classification problem can be solved if we apply two 
CA rules in succession ||. Later on, Chau et al. gener- 
alized his result by showing that classifying any rational 
density p c on a N site chain can be performed efficiently 
in O(N) time using two CA rules in succession [||. 

At this point, it is natural to ask if CA can be used to 
classify nary density, namely, when each site takes on a 
nary state. In fact, Land and Belew conjectured that a 
significantly different way of argument is required to gen- 
eralize their "no-go" result to the nary case [Q. In this 
Letter, we first prove that nary density classification by 



CA is impossible. Although we only report our proof for 
the one-dimensional case, our arguments can be readily 
generalized to multiple dimensions. But most important 
of all, our proof provides a hint to solve the nary density 
classification problem using two CA rules. Guided by 
this hint, we report a simple and efficient way to classify 
nary density with two CA rules. 

Statement of the problem — We begin by formally 
defining the nary density classification problem. Sup- 
pose that each of the one-dimensional chain of N sites 
in periodic boundary condition may take on a state in 
0, 1, . . . , n— 1. We define Pi(a) as the number of sites in 
state i for the configuration a divided by N. That is, 
Pi{a) is the density of the state i of the configuration a. 
Our goal is to evolve the state of all sites to i if and only 
if Pi(a) > Pj(a) for all j ^ i. While this task is trivial 
if one has a global counter, we now show that this task 
is not achievable using a (deterministic) CA. More pre- 
cisely, we mean that given any deterministic CA with a 
finite rule table, we can find a configuration with a suf- 
ficiently large N such that this configuration cannot be 
correctly classified by this CA. 

Impossibility to classify density by one CA rule — We 
prove our impossibility result by contradiction. Suppose 
a CA rule with radius r correctly classifies any nary den- 
sity for all N, then we denote the action of this CA on a 
configuration a by T. Moreover, we denote the set of all 
configurations with density pi > pj for all j ^ i by f2j. 
Then, Land and Belew showed that || 

Lemma 1 T [fij C 4 for all i. Besides, T [j N ] = j N 
where j N denotes a configuration of N consecutive js. In 
fact, if the states of a site and its 2r neighbors are all j , 
then the state of that site under the action of T equals 
j . Similarly, if a is a period 2r + 1 sequence, then so is 
T(a). 

Now, we make the following claim: 
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Claim 1 The action of T on any configuration a pre- 
serves the densities pi of the configuration a. 

Clearly, our conclusion that no such CA rule T exits 
follows directly from Claim [l] because the state 00 ... 001 
cannot be brought to 00 . . . 000 under the repeated action 
of T. 

Proof of the claim: Suppose the contrary, then we can 
find a configuration (3 whose density p t is not preserved 
under the action of T for some i. Without lost of 
generality, we may assume that po(0) > po(T(/3)) and 
Pi(/3) < pi(T(/3)). Otherwise the proof is similar. If we 
denote the length of (3 by s, then p (T(/3)) < p ([3)-l/s. 
From Lemma ^, wc know that f3 must make up of more 
than one type of states. Now, we consider configurations 
in the form 7 = f3 x V 1 Z ; that is, 7 makes up of x copies 
of /3 followed by y consecutive zeros and then z consec- 
utive ones. We choose the ratio x : y : z in such a way 
that 7 G ^0 while pi (7) < p (7) < Pi (7) + 2{sx + v+z ) - 
In other words, 7 has a slight excess of zeros over ones 
to put it in fio- Now, we consider the state T(7). Since 
the radius of our CA equals r, so except for the 6r sites 
at the boundaries between fj, and 1, Lemma [j] tells 
us that states of all other sites containing V 1 Z are un- 
changed by the action of T. Consequently, p (T(j)) < 
Po(l) - + j^+- z - So, if we choose sufficiently 

large values of x, y and z while keeping the ratio x:y:z 
fixed, we can make p (T(-y)) < p (-f) - 2{sx + y+z) while 
pi(T( 7 )) > Pl ( 7 ) + 2{sx + y+z y Therefore, po(T( 7 )) < 
pi(T( 7 )). Hence Tfa) & O although 7 G fi - This 
contradicts Lemma [y. □ 

Clearly, we can modify the above proof to show that 
nary density classification in multiple dimension by a sin- 
gle CA is also impossible. More importantly, Claim [j] 
tells us that in order to construct an nary density classi- 
fication rule using two CAs, the first rule must preserve 
densities. This is the reason why both the traffic rule and 
modified traffic rule used by Fuks || and Chau et al. || 
in binary density classification are density preserving. 

Classification of nary density using two CA rules - 
Let us introduce a few useful definitions before reporting 
the two CA rules that classifies nary density. 

Definition 1 Let a = (ai)fL 1 be a one- dimensional con- 
figuration in periodic boundary condition. An elemen- 
tary block (EB) in a is defined to be a subsequence of 
consecutive states (bj) k j =1 of a with b\ < b 2 < ■ ■ ■ < 6fc. 
Moreover, such a subsequence is maximal in the sense 
that inclusion of further element into the subsequence in 
either ends does not produce any EB. 

Clearly, any configuration a can be uniquely decom- 
posed into a collection of EBs. For instance, the con- 
figuration 40223441 makes up of EBs 02, 234, 4 and 14. 
Furthermore, for an nary state system, elementary com- 



binatorics arguments shows that there are totally 2™ — I 
different possible EBs. 

Definition 2 We define the homogeneous lexicographic 
(hlex) ordering to these 2™ — I EBs as follows: An EB 
(a,j) is greater than another EB (bj) if and only if (1) 
the length of (aj) is greater than that of (bj); or (2) the 
lengths of (aj) and (bj) agree and aj > bj for the first 
index j with aj ^ bj . 

We denote the 2"— 1 EBs by £>;S. Then, the hlex order 
goes as B a = < Uex B\ = 1 < h i C x • ■ ■ <hicx = 

n—1 <hlcx Bn = 01 <hlcx B n +1 = 02 • • • <hlex San-2 = 

0(n-I) < hlcx ••• < hlcx £ 2 "- 2 = 012... (n- 1). We may 
interpret the hlex ordering as a measure of affinity be- 
tween different states. 

Our goal of the first CA rule is to make the EBs to 
interact with each other, hoping to make as many high 
affinity EBs as possible. We achieve this goal by first in- 
troducing a greedy interaction between two adjacent EBs. 
Since the union and intersection of elements of two EBs 
can form EBs. So, we have: 

Definition 3 Let C\C 2 be two adjacent EBs, and we 
abuse the notation a bit by writing C\ and C 2 both as 
sequences and sets. Then, we define L(C\C 2 ) = T>{D 2 
where D\ is the EB formed by the set C\ U C 2 , and T> 2 is 
the (possibly empty) EB formed by the elements inC\C\C 2 . 
In case T>\D 2 equals C\C 2 or C 2 C\, then we said that C\ 
interacts elastically with C 2 . Otherwise, we said that C\ 
interacts inelastically with C 2 ■ 

From the above definition, readers can verify that 
7(2301) = 0123, 7(02123) = 01232, 7(01201) = 01201 
and 7(0101) = 0101. We also observe that 

Lemma 2 The EB T>\ defined above is the highest pos- 
sible EB formed from C\ U C 2 with respected to the hlex 
order. Furthermore, T> 2 <hiex7?i. 

Proof: Since all elements in an EB are distinct, so the 
highest possible affinity EB is formed by selecting all the 
elements in C\ U C 2 . In addition, T> 2 <hicx 2?i follows 
directly from the fact that C\ H C 2 C C\ U C 2 . □ 

Since T>i is the highest possible affinity EB constructed 
out of C\ and C 2 , the greedy interaction is an effective way 
to construct high affinity while keeping the density pi of 
a configuration. Now, we report two technical lemmas 
before going on. 

Lemma 3 Let a be a one- dimensional configuration with 
N sites. Suppose there exist two EBs in a which can in- 
teract inelastically with each other, then the total number 
of EBs in a is less than or equal to N — 2. Besides, this 
bound is tight. 
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Proof: Suppose a contains N EBs, then from Defini- 
tion [|, it is easy to see that states in all the N sites must 
be identical. Hence, none of the EB interacts inelasti- 
cally with each other. Similarly, if a contains N—l EBs, 
then exactly N— 2 of them contains one element and one 
of them contains two elements. Since periodic boundary 
conditions applies to the iV sites, again from Definition [l], 
we know that all the iV — 2 singleton EBs must be in an 
identical state, say a. Suppose the remaining EB is in 
state be, then b < a < c. (Otherwise, we can extend 
the size of the EB be by one, contradicting the maxi- 
mally of an EB.) Now, it is easy to check that a and 
be interact elastically. Thus, an inelastically interacting 
configuration must contain less than N — l EBs. Finally, 
the tightness of this bound is revealed by the configura- 
tion 020100000. □ 

Lemma 4 Let a be a configuration whose EB only in- 
teracts elastically with each other. Then a 6 fli if and 
only if Bi is an EB in a . 

Proof: From Definition [| two EBs interacts elastically 
if and only if one is a subset of the other. Hence, if the 
EBs in a interact elastically, we can always arrange them 
sequentially so that one is a subset of the next. Clearly, 
the most frequently occurring state(s) are the ones in 
the first element of this sequence C. Thus, a £ fij if and 
only if i is the only element in C. Hence, our assertion is 
proved. □ 

At this point, we have introduced enough material to 
present our first CA rule for the nary density classifi- 
cation problem. We express a configuration a in terms 
of its EBs C1C2 ■ ■ • Cj. Inspired by the traffic rule used in 
Ref. j| , we define the basic rules R, for i = 0, 1, . . . , 2 n — 2 
as follows: Each EB Ck C a with Ck = Bi will inter- 
act with its forward neighboring EB Ck+i if and only if 
Ck+i 7^ Bi. Besides, all interactions are taken in paral- 
lel. For instance, Ri(1021101) = 0121011. Clearly, R; 
conserves pj . More importantly, it is straight-forward to 
check that R^ is equivalent to the following rules: (1) If 
Ck-i, Ck ^ Bi or Ck, Ck+i — Bi, then the states of the 
sites occupied by Ck remain unchanged; (2) Otherwise, 
if Ck-i = Bi, then the states of the sites occupied by Ck 
become the last £(Ck) states of I(Ck-iCk) where £(Ck) de- 
notes the length of the sequence Ck ; (3) For the remaining 
case that Ck+i ^ Bi, then the states of the sites occupied 
by Ck become the first £(Ck) states of I(CkCk+i). Hence, 
Hi is a CA rule with radius n+£(B,) — 1. 

Since finite composition of CA rules is also a CA rule, 
we have: 

Definition 4 We write the affinity rule A = Ro o 
(Ro o Ri) o (R o H 1 o R 2 ) o • ■ -o(R o Rj o • • • o R 2 n_ 2 ). 

(That is, a total of (2"- 1 - 1)(2" - 3) terms in the 
above composition.) Then, the CA rule A preserves 



Pi and has a radius between n(2 n 1 — 1)(2" — 3) and 
(2n-l)(2™- 1 -3)(2"-3). 

Note that the mobility of an EB increases with de- 
creasing affinity, so the spirit of our affinity rule is to 
aggressively produce a much high affinity EBs as we can. 
In fact, we find that 

Lemma 5 The total number of EBs of a configuration 
a is greater than or equal to that of A(a). Besides, un- 
der the repeated action of A, the number of EBs in a 
configuration will eventually stay constant. 

Proof: The first part of the lemma follows directly from 
Definitions || and f|. Since N is finite, the total number 
of EBs for any configuration must lie between 1 and N. 
Since the number of EBs is a decreasing function of A 
and n is finite, so under the repeated action of A, the 
number of EBs in a configuration will eventually stay 
constant. □ 

With all the preparation works above, we have confi- 
dent to use A as our first CA rule in the nary density 
classification problem. The power of this CA rule is ap- 
parent from the Theorem below. 

Theorem 1 Let a be a configuration in a one- 
dimensional chain of N sites. Then a € fii if and only 
if A N ~ 2 (a) contains an EB Bi. 

Proof: By Lemma 0, we only need to prove that for any 
configuration a, A (a) does not contain inelastically 
interacting EBs. In addition, we may assume that a 
contains at least a pair of inelastically interacting EBs. 
Otherwise, our assertion is trivially true. 

We follow the motion of an arbitrarily chosen EB un- 
der the action repeated of A. After the EB interacts with 
another one, then we turn to follow the motion of the left 
resultant EB. Besides, we turn to follow the motion of the 
left neighboring EB in case the left neighboring EB equals 
to that of our current EB. So, Lemma ^ tells us that the 
affinity of the EB we are following increases with time. 
In addition, Definition || tells us that EBs with different 
affinity interact at a different rate. More importantly, by 
direct checking, one sees that our monitoring EB must 
interact with the left neighboring EB under the action 
of A. Besides, under the action of A, two distance EBs 
can interact only when one of them first hops through all 
the EBs separating them by elastic interactions. Now, 
we denote the highest affinity EB under our monitoring 
scheme in A N ~ 3 (a) by C\, then from Lemmas |,|and|, 
all EBs in A N ~ 3 (a) are subsets of C 2 . 

Now, we trace the motion of an arbitrarily chosen EBs 
in exactly the same way as before except that when the 
tracing EB interacts with the EB that leads to C\ on its 
way, then we turn to trace the motion of resultant right 
EB. Now, we write C 2 as the highest affinity EB under 
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this new monitoring scheme in A Ar_3 (a). Then, using 
the same argument as before, it is clear that all EBs in 
A N ~ 3 (a) but C\ are subsets of C%. Besides, C 2 C C\. 
Inductively, by tracing the motion of EBs in a similar 
way that leads to C\ and C2, we conclude that the EBs in 
A N ~ 3 (a) interact elastically. Hence, by Lemma |, this 
theorem is proved. □ 

With the help of Theorem [ij the nary density classi- 
fication problem using two CAs can be easily executed. 

Definition 5 We express a configuration as a collection 
of EBs. For any three consecutive EBs C1C2C3, we define 
the propagation rule P as follows: The states of sites in 
C2 remain unchanged if none of the three blocks C\ , C2 
and C3 equals Bi for all < i < n. Otherwise, we set 
the state of sites in C2 to i where i is the state of the 
minimum affinity EB among C\ , C2 and C3 . 

Since we can only find at most one type of Bi (0 < i < 
n) in A N ~ 3 (a), repeatedly applying the propagation rule 
P to A N ~ 3 (a) results in propagating that particular EB 
Bi. In addition, similar to the affinity rule, it is straight- 
forward to show that P is a CA rule with a radius 2n— 1. 
Besides, we have 

Theorem 2 If a is a configuration whose EB only in- 
teracts elastically, then applying P [T /2] times to 
a will result in having all is in the configuration if and 
only if a £ fli . 

Proof: Lemma |i] tells us that a contains the EB Bi if 
and only if a s for < i < n. That is, we can find 
at most one type of EB Bi for < i < n in a. Clearly, 
the worst case occurs when a contains exact one such Bi. 
Besides, all other EBs in a are of length two. In this 
case, it is clear that applying P [[^5-^] /2] times result 
in converting all sites to state i. Hence it is proved. □ 

In summary, we show that it is impossible to solve the 
nary density classification problem in any dimension us- 
ing one CA. Nevertheless, we can solve this problem by 
first applying the affinity rule A (with a radius between 
ji(2 n_1 — 1)(2"-3) and (2n-l)(2"- 1 -l)(2™-3) ) N-3 



times and then followed by the propagation rule P (with 
a radius of 2n— 1) [[-^p-] /2] times. Since the run time 
of the two CA rules scales as O(N), so apart from a con- 
stant speed up, our two CA nary density classification 
rules are optimal. In other words, the dynamics of our 
two CA rules gives rise to n stable fixed points each with 
configuration i , Moreover, we find a large number of 
unstable fixed points whenever there is more than one 
most frequently occurring state in a configuration. Fi- 
nally, we remark that these CA rules are not unique and 
other equally good method exists ||. 

It is instructive to extend the nary density classifica- 
tion problem to rational function classification similar to 
that reported in Ref. as well as to higher dimensional 
lattices. We plan to report these results in future. 
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